ABSTRACT
Institutions of higher education (IHEs) have been a focus of SARS-CoV-2 transmission studies but there is limited information on how viral diversity and transmission at IHEs changed as the pandemic progressed. Here we analyze 3606 viral genomes from unique COVID-19 episodes collected at a public university in Seattle, Washington (WA) from September 2020 to September 2022. Across the study period, we found evidence of frequent viral transmission among university affiliates with 60% (n=2153) of viral genomes from campus specimens genetically identical to at least one other campus specimen. Moreover, viruses from students were observed in transmission clusters at a higher frequency than in the overall dataset while viruses from symptomatic infections were observed in transmission clusters at a lower frequency. Though only a small percentage of community viruses were identified as possible descendants of viruses isolated in university study specimens, phylodynamic modelling suggested a high rate of transmission events from campus into the local community, particularly during the 2021-2022 academic year. We conclude that viral transmission was common within the university population throughout the study period but that not all university affiliates were equally likely to be involved. In addition, the transmission rate from campus into the surrounding community may have increased during the second year of the study, possibly due to return to in-person instruction.
Subject(s)
COVID-19ABSTRACT
Public health researchers and practitioners commonly infer phylogenies from viral genome sequences to understand transmission dynamics and identify clusters of genetically-related samples. However, viruses that reassort or recombine violate phylogenetic assumptions and require more sophisticated methods. Even when phylogenies are appropriate, they can be unnecessary or difficult to interpret without specialty knowledge. For example, pairwise distances between sequences can be enough to identify clusters of related samples or assign new samples to existing phylogenetic clusters. In this work, we tested whether dimensionality reduction methods could capture known genetic groups within two human pathogenic viruses that cause substantial human morbidity and mortality and frequently reassort or recombine, respectively: seasonal influenza A/H3N2 and SARS-CoV-2. We applied principal component analysis (PCA), multidimensional scaling (MDS), t-distributed stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection (UMAP) to sequences with well-defined phylogenetic clades and either reassortment (H3N2) or recombination (SARS-CoV-2). For each low-dimensional embedding of sequences, we calculated the correlation between pairwise genetic and Euclidean distances in the embedding and applied a hierarchical clustering method to identify clusters in the embedding. We measured the accuracy of clusters compared to previously defined phylogenetic clades, reassortment clusters, or recombinant lineages. We found that MDS maintained the strongest correlation between pairwise genetic and Euclidean distances between sequences and best captured the intermediate placement of recombinant lineages between parental lineages. Clusters from t-SNE most accurately recapitulated known phylogenetic clades and recombinant lineages. Both MDS and t-SNE accurately identified reassortment groups. We show that simple statistical methods without a biological model can accurately represent known genetic relationships for relevant human pathogenic viruses. Our open source implementation of these methods for analysis of viral genome sequences can be easily applied when phylogenetic methods are either unnecessary or inappropriate.
ABSTRACT
Genomic surveillance of pathogen evolution is essential for public health response, treatment strategies, and vaccine development. In the context of SARS-COV-2, multiple models have been developed including Multinomial Logistic Regression (MLR) describing variant frequency growth as well as Fixed Growth Advantage (FGA), Growth Advantage Random Walk (GARW) and Piantham parameterizations describing variant Rt. These models provide estimates of variant fitness and can be used to forecast changes in variant frequency. We introduce a framework for evaluating real-time forecasts of variant frequencies, and apply this framework to the evolution of SARS-CoV-2 during 2022 in which multiple new viral variants emerged and rapidly spread through the population. We compare models across representative countries with different intensities of genomic surveillance. Retrospective assessment of model accuracy highlights that most models of variant frequency perform well and are able to produce reasonable forecasts. We find that the simple MLR model provides ~0.6% median absolute error and ~6% mean absolute error when forecasting 30 days out for countries with robust genomic surveillance. We investigate impacts of sequence quantity and quality across countries on forecast accuracy and conduct systematic downsampling to identify that 1000 sequences per week is fully sufficient for accurate short-term forecasts. We conclude that fitness models represent a useful prognostic tool for short-term evolutionary forecasting.
Subject(s)
Seizures , Refractive ErrorsABSTRACT
Through antigenic evolution, viruses like seasonal influenza evade recognition by neutralizing antibodies elicited by previous infection or vaccination. This means that a person with antibodies well-tuned to an initial infection will not be protected against the same virus years later and that vaccine-mediated protection will decay. It is not fully understood which of the many endemic human viruses evolve in this fashion. To expand that knowledge, we assess adaptive evolution across the viral genome in 28 endemic viruses, spanning a wide range of viral families and transmission modes. We find that surface proteins consistently show the highest rates of adaptation, and estimate that ten viruses in this panel undergo antigenic evolution to selectively fix mutations that enable the virus to escape recognition by prior immunity. We compare overall rates of amino acid substitution between these antigenically-evolving viruses and SARS-CoV-2, showing that SARS-CoV-2 viruses are accumulating protein-coding changes at substantially faster rates than these endemic viruses.
ABSTRACT
SARS-CoV-2 transmission is largely driven by heterogeneous dynamics at a local scale, leaving local health departments to design interventions with limited information. We analyzed SARS-CoV-2 genomes sampled between February 2020 and March 2022 jointly with epidemiological and cell phone mobility data to investigate fine scale spatiotemporal SARS-CoV-2 transmission dynamics in King County, Washington, a diverse, metropolitan US county. We applied an approximate structured coalescent approach to model transmission within and between North King County and South King County alongside the rate of outside introductions into the county. Our phylodynamic analyses reveal that following stay-at-home orders, the epidemic trajectories of North and South King County began to diverge. We find that South King County consistently had more reported and estimated cases, COVID-19 hospitalizations, and longer persistence of local viral transmission when compared to North King County, where viral importations from outside drove a larger proportion of new cases. Using mobility and demographic data, we also find that South King County experienced a more modest and less sustained reduction in mobility following stay-at-home orders than North King County, while also bearing more socioeconomic inequities that might contribute to a disproportionate burden of SARS-CoV-2 transmission. Overall, our findings suggest a role for local-scale phylodynamics in understanding the heterogeneous transmission landscape. One Sentence SummaryAnalysis of SARS-CoV-2 genomes in King County, Washington show that diverse areas in the same metropolitan region can have different epidemic dynamics.
Subject(s)
COVID-19ABSTRACT
ImportanceFew US studies have reexamined risk factors for SARS-CoV-2 positivity in the context of widespread vaccination and new variants or considered risk factors for co-circulating endemic viruses, such as rhinovirus. ObjectiveTo understand how risk factors and symptoms associated with SARS-CoV-2 test positivity changed over the course of the pandemic and to compare these to the factors associated with rhinovirus test positivity. DesignThis test-negative design study used multivariable logistic regression to assess associations between SARS-CoV-2 and rhinovirus test positivity and self-reported demographic and symptom variables over a 22-month period. SettingKing County, Washington, June 2020-April 2022 Participants23,278 symptomatic individuals of all ages enrolled in a cross-sectional community surveillance study. ExposuresSelf-reported data for 15 demographic and health behavior variables and 16 symptoms. Main Outcome(s) and Measure(s)RT-PCR confirmed SARS-CoV-2 or rhinovirus infection. ResultsClose contact with a SARS-CoV-2 case (adjusted odds ratio, aOR 4.3, 95% CI 3.7-5.0) and loss of smell/taste (aOR 3.7, 95% CI 3.0-4.5) were the variables most associated with SARS-CoV-2 test positivity, but both attenuated during the Omicron period. Contact with a vaccinated case (aOR 2.4, 95% CI 1.7-3.3) was associated with a lower odds of test positivity than contact with an unvaccinated case (aOR 4.4, 95% CI 2.7-7.3). Sore throat was associated with Omicron infection (aOR 2.3, 95% CI 1.6-3.2) but not Delta. Vaccine effectiveness for participants fully vaccinated with a booster dose was 43% (95% CI 11-63%) for Omicron and 92% (95% CI 61-100%) for Delta. Variables associated with rhinovirus test positivity included age <12 years (aOR 4.0, 95% CI 3.5-4.6) and reporting a runny or stuffy nose (aOR 4.6, 95% CI 4.1-5.2). Race, region, and household crowding were significantly associated with both SARS-CoV-2 and rhinovirus test positivity. Conclusions and RelevanceEstimated risk factors and symptoms associated with SARS-CoV-2 infection have changed over time. There was a shift in reported symptoms between the Delta and Omicron variants as well as reductions in the protection provided by vaccines. Racial and socioeconomic disparities persisted in the third year of SARS-CoV-2 circulation and were also present in rhinovirus infection, although the causal pathways remain unclear. Trends in testing behavior and availability may influence these results. Key Points QuestionWhat are the characteristics associated with SARS-CoV-2 and rhinovirus infection? FindingsIn this test-negative design study of 23,278 participants, reporting close contact with a SARS-CoV-2 case was the strongest risk factor associated with test positivity. Loss of smell and taste was associated with the Delta variant, but not the Omicron variant. Vaccination and prior infection provided greater protection against Delta infection than Omicron Infection. Young age was the strongest predictor of rhinovirus positivity. Sociodemographic disparities were present for both SARS-CoV-2 and rhinovirus. MeaningMonitoring factors associated with respiratory pathogen test positivity remains important to identify at-risk populations in the post-SARS-CoV-2 pandemic period.
Subject(s)
COVID-19 , Severe Acute Respiratory Syndrome , InfectionsABSTRACT
Spatial properties of tumor growth have profound implications for cancer progression, therapeutic resistance and metastasis. Yet, how spatial position governs tumor cell division remains difficult to evaluate in clinical tumors. Here, we demonstrate that elevated cellular growth rates on the tumor periphery leave characteristic patterns in the genomes of cells sampled from different parts of a tumor, which become evident when they are used to construct a tumor phylogenetic tree. Namely, rapidly-dividing peripheral lineages branch more extensively and acquire more mutations than slower-dividing lineages in the tumor center. We develop a Bayesian state-dependent evolutionary phylodynamic model (SDevo) that quantifies these patterns to infer the differential cell division rates between peripheral and central cells jointly from the branching and mutational patterns of single-time point, multi-region sequencing data. We validate this approach on simulated tumors by demonstrating its ability to accurately infer spatially-varying birth rates under a range of growth conditions and sampling strategies. We then show that SDevo outperforms state-of-the-art, non-cancer multi-state phylodynamic methods which ignore differential mutational acquisition. Finally, we apply SDevo to multi-region sequencing data from clinical hepatocellular carcinomas and find evidence that cells on the tumor edge divide 2-4x faster than those in the center. As multi-region and single-cell sequencing increase in resolution and availability, we anticipate that SDevo will be useful in interrogating spatial restrictions on tumor growth and could be extended to model non-spatial factors that influence tumor progression, including hypoxia and immune infiltration.
Subject(s)
Neoplasms , Carcinoma, Hepatocellular , Growth Disorders , HypoxiaABSTRACT
Novel variants continue to emerge in the SARS-CoV-2 pandemic. University testing programs may provide timely epidemiologic and genomic surveillance data to inform public health responses. We conducted testing from September 2021 to February 2022 in a university population under vaccination and indoor mask mandates. A total of 3,048 of 24,393 individuals tested positive for SARS-CoV-2 by RT-PCR; whole genome sequencing identified 209 Delta and 1,730 Omicron genomes of the 1,939 total sequenced. Compared to Delta, Omicron had a shorter median serial interval between genetically identical, symptomatic infections within households (2 versus 6 days, P=0.021). Omicron also demonstrated a greater peak reproductive number (2.4 versus 1.8) and a 1.07 (95% confidence interval: 0.58, 1.57; P<0.0001) higher mean cycle threshold value. Despite near universal vaccination and stringent mitigation measures, Omicron rapidly displaced the Delta variant to become the predominant viral strain and led to a surge in cases in a university population.
ABSTRACT
Investment in Africa over the past year with regards to SARS-CoV-2 genotyping has led to a massive increase in the number of sequences, exceeding 100,000 genomes generated to track the pandemic on the continent. Our results show an increase in the number of African countries able to sequence within their own borders, coupled with a decrease in sequencing turnaround time. Findings from this genomic surveillance underscores the heterogeneous nature of the pandemic but we observe repeated dissemination of SARS-CoV-2 variants within the continent. Sustained investment for genomic surveillance in Africa is needed as the virus continues to evolve, particularly in the low vaccination landscape. These investments are very crucial for preparedness and response for future pathogen outbreaks.
ABSTRACT
Background Co-circulating respiratory pathogens can interfere with or promote each other, leading to important effects on disease epidemiology. Estimating the magnitude of pathogen-pathogen interactions from clinical specimens is challenging because sampling from symptomatic individuals can create biased estimates. Methods We conducted an observational, cross-sectional study using samples collected by the Seattle Flu Study between 11 November 2018 and 20 August 2021. Samples that tested positive via RT-qPCR for at least one of 17 potential respiratory pathogens were included in this study. Semi-quantitative cycle threshold (Ct) values were used to measure pathogen load. Differences in pathogen load between monoinfected and coinfected samples were assessed using linear regression adjusting for age, season, and recruitment channel. Results 21,686 samples were positive for at least one potential pathogen. Most prevalent were rhinovirus (33·5%), Streptococcus pneumoniae ( SPn , 29·0%), SARS-CoV-2 (13.8%) and influenza A/H1N1 (9·6%). 140 potential pathogen pairs were included for analysis, and 56 (40%) pairs yielded significant Ct differences (p < 0.01) between monoinfected and co-infected samples. We observed no virus-virus pairs showing evidence of significant facilitating interactions, and found significant viral load decrease among 37 of 108 (34%) assessed pairs. Samples positive with SPn and a virus were consistently associated with increased SPn load. Conclusions Viral load data can be used to overcome sampling bias in studies of pathogen-pathogen interactions. When applied to respiratory pathogens, we found evidence of viral- SPn facilitation and several examples of viral-viral interference. Multipathogen surveillance is a cost-efficient data collection approach, with added clinical and epidemiological informational value over single-pathogen testing, but requires careful analysis to mitigate selection bias.
Subject(s)
Influenza, Human , Pneumococcal InfectionsABSTRACT
Accurately estimating relative transmission rates of SARS-CoV-2 variants remains a scientific and public health priority. Recent studies have used the sample proportions of different variants from genetic sequence data to describe variant frequency dynamics and relative transmission rates, but frequencies alone cannot capture the rich epidemiological behavior of SARS-CoV-2. Here, we extend methods for inferring the effective reproduction number of an epidemic using confirmed case data to jointly estimate variant-specific effective reproduction numbers and frequencies of cocirculating variants using cases and sequences across states in the US from January 2021 to March 2022. Our method can be used to infer structured relationships between effective reproduction numbers across time series allowing us to estimate fixed variant-specific growth advantages. We use this model to estimate the effective reproduction number of SARS-CoV-2 Variants of Concern and Variants of Interest in the United States and estimate consistent growth advantages of particular variants across different locations.
ABSTRACT
BackgroundThe COVID-19 pandemic is now dominated by variant lineages; the resulting impact on disease severity remains unclear. Using a retrospective cohort study, we assessed the risk of hospitalization following infection with nine variants of concern or interest (VOC/VOI). MethodsOur study includes individuals with positive SARS-CoV-2 RT-PCR in the Washington Disease Reporting System and with available viral genome data, from December 1, 2020 to July 30, 2021. The main analysis was restricted to cases with specimens collected through sentinel surveillance. Using a Cox proportional hazards model with mixed effects, we estimated hazard ratios (HR) for the risk of hospitalization following infection with a VOC/VOI, adjusting for age, sex, and vaccination status. FindingsOf the 27,814 cases, 23,170 (83.3%) were sequenced through sentinel surveillance, of which 726 (3.1%) were hospitalized due to COVID-19. Higher hospitalization risk was found for infections with Gamma (HR 3.17, 95% CI 2.15-4.67), Beta (HR: 2.97, 95% CI 1.65-5.35), Delta (HR: 2.30, 95% CI 1.69-3.15), and Alpha (HR 1.59, 95% CI 1.26-1.99) compared to infections with an ancestral lineage. Following VOC infection, unvaccinated patients show a similar higher hospitalization risk, while vaccinated patients show no significant difference in risk, both when compared to unvaccinated, ancestral lineage cases. InterpretationInfection with a VOC results in a higher hospitalization risk, with an active vaccination attenuating that risk. Our findings support promoting hospital preparedness, vaccination, and robust genomic surveillance.
Subject(s)
COVID-19 , Disease , InfectionsABSTRACT
Despite the appearance of variant SARS-CoV-2 viruses with altered receptorbinding or antigenic phenotypes, traditional methods for detecting adaptive evolution from sequence data do not pick up strong signals of positive selection. Here, we present a new method for identifying adaptive evolution on short evolutionary time scales with densely-sampled populations. We apply this method to SARS-CoV-2 to perform a comprehensive analysis of adaptively-evolving regions of the genome. We find that spike S1 is a focal point of adaptive evolution, but also identify positively-selected mutations in other genes that are sculpting the evolutionary trajectory of SARS-CoV-2. Protein-coding mutations in S1 are temporally-clustered and, in 2021, the ratio of nonsynonymous to synonymous divergence in S1 is more than 4 times greater than in the equivalent influenza HA1 subunit.
ABSTRACT
As shown during the SARS-CoV-2 pandemic, phylogenetic and phylodynamic methods are essential tools to study the spread and evolution of pathogens. One of the central assumptions of these methods is that the shared history of pathogens isolated from different hosts can be described by a branching phylogenetic tree. Recombination breaks this assumption. This makes it problematic to apply phylogenetic methods to study recombining pathogens, including, for example, coronaviruses. Here, we introduce a Markov chain Monte Carlo approach that allows inference of recombination networks from genetic sequence data under a template switching model of recombination. Using this method, we first show that recombination is extremely common in the evolutionary history of SARS-like coronaviruses. We then show how recombination rates across the genome of the human seasonal coronaviruses 229E, OC43 and NL63 vary with rates of adaptation. This suggests that recombination could be beneficial to fitness of human seasonal coronaviruses. Additionally, this work sets the stage for Bayesian phylogenetic tracking of the spread and evolution of SARS-CoV-2 in the future, even as recombinant viruses become prevalent.
ABSTRACT
BackgroundTesting programs have been utilized as part of SARS-CoV-2 mitigation strategies on university campuses, and it is not known which strategies successfully identify cases and contain outbreaks. ObjectiveEvaluation of a testing program to control SARS-CoV-2 transmission at a large university. DesignProspective longitudinal study using remote contactless enrollment, daily mobile symptom and exposure tracking, and self-swab sample collection. Individuals were tested if the participant was (1) exposed to a known case, developed new symptoms, or reported high-risk behavior, (2) a member of a group experiencing an outbreak, or (3) at baseline upon enrollment. SettingAn urban, public university during Autumn quarter of 2020 ParticipantsStudents, staff, and faculty. MeasurementsSARS-CoV-2 PCR testing was conducted, and viral genome sequencing was performed. ResultsWe enrolled 16,476 individuals, performed 29,783 SARS-CoV-2 tests, and detected 236 infections. Greek community affiliation was the strongest risk factor for testing positive. 75.0% of positive cases reported at least one of the following: symptoms (60.8%), exposure (34.7%), or high-risk behaviors (21.5%). 88.1% of viral genomes (52/59) sequenced from Greek-affiliated students were genetically identical to at least one other genome detected, indicative of rapid SARS-CoV-2 spread within this group, compared to 37.9% (11/29) of genomes from non-Greek students and employees. LimitationsObservational study. ConclusionIn a setting of limited resources during a pandemic, we prioritized testing of individuals with symptoms and high-risk exposure during outbreaks. Rapid spread of SARS- CoV-2 occurred within outbreaks without evidence of further spread to the surrounding community. A testing program focused on high-risk populations may be effective as part of a comprehensive university-wide mitigation strategy to control the SARS-CoV-2 pandemic.
ABSTRACT
Real-time epidemiological tracking of variants of interest can help limit the spread of more contagious forms of SARS-CoV-2, such as those containing the N501Y mutation. Typically, genetic sequencing is required to be able to track variants of interest in real-time. However, sequencing can take time and may not be accessible in all laboratories. Genotyping by RT-ddPCR offers an alternative to sequencing to rapidly detect variants of concern through discrimination of specific mutations such as N501Y that is associated with increased transmissibility. Here we describe the first cases of the B.1.1.7 lineage of SARS-CoV-2 detected in Washington State by using a combination of RT-PCR, RT-ddPCR, and next-generation sequencing. We screened 1,035 samples positive for SARS-CoV-2 by our CDC-based laboratory developed assay using ThermoFishers multiplex RT-PCR COVID-19 assay over four weeks from late December 2020 to early January 2021. S gene dropout candidates were subsequently assayed by RT-ddPCR to confirm four mutations within the S gene associated with the B.1.1.7 lineage: a deletion at amino acid (AA) 69-70 (ACATGT), deletion at AA 145, (TTA), N501Y mutation (TAT), and S982A mutation (GCA). All four targets were detected in two specimens, and follow-up sequencing revealed a total of 10 mutations in the S gene and phylogenetic clustering within the B.1.1.7 lineage. As variants of concern become increasingly prevalent, molecular diagnostic tools like RT-ddPCR can be utilized to quickly, accurately, and sensitively distinguish more contagious lineages of SARS-CoV-2.
Subject(s)
COVID-19ABSTRACT
In October 2020, an outbreak of at least 50 COVID-19 cases was reported surrounding individuals employed at or visiting the White House. Here, we applied genomic epidemiology to investigate the origins of this outbreak. We enrolled two individuals with exposures linked to the White House COVID-19 outbreak into an IRB-approved research study and sequenced their SARS-CoV-2 infections. We find these viral sequences are highly genetically similar to each other, but are distinct from over 160,000 publicly available SARS-CoV-2 genomes, possessing 5 nucleotide mutations that differentiate this lineage from all other circulating lineages sequenced to date. We estimate this lineage has a common ancestor in the USA in April or May 2020, but its whereabouts for the past 5 to 6 months are not clear. Looking forwards, sequencing of additional community SARS-CoV-2 infections collected in the USA prior to October 2020 may reveal linked infections and shed light on its geographic ancestry. In sequencing of SARS-CoV-2 infections collected after October 2020, the relative rarity of this constellation of mutations may make it possible to identify infections that likely descend from the White House COVID-19 outbreak.
Subject(s)
COVID-19 , Severe Acute Respiratory Syndrome , InfectionsABSTRACT
Seasonal coronaviruses (OC43, 229E, NL63 and HKU1) are endemic to the human population, regularly infecting and reinfecting humans while typically causing asymptomatic to mild respiratory infections. It is not known to what extent reinfection by these viruses is due to waning immune memory or antigenic drift of the viruses. Here, we address the influence of antigenic drift on immune evasion of seasonal coronaviruses. We provide evidence that at least two of these viruses, OC43 and 229E, are undergoing adaptive evolution in regions of the viral spike protein that are exposed to human humoral immunity. This suggests that reinfection may be due, in part, to positively-selected genetic changes in these viruses that enable them to escape recognition by the immune system. It is possible that, as with seasonal influenza, these adaptive changes in antigenic regions of the virus would necessitate continual reformulation of a vaccine made against them.
Subject(s)
Respiratory Tract InfectionsABSTRACT
The rapid spread of SARS-CoV-2 has gravely impacted societies around the world. Outbreaks in different parts of the globe are shaped by repeated introductions of new lineages and subsequent local transmission of those lineages. Here, we sequenced 3940 SARS-CoV-2 viral genomes from Washington State to characterize how the spread of SARS-CoV-2 in Washington State (USA) was shaped by differences in timing of mitigation strategies across counties, as well as by repeated introductions of viral lineages into the state. Additionally, we show that the increase in frequency of a potentially more transmissible viral variant (614G) over time can potentially be explained by regional mobility differences and multiple introductions of 614G, but not the other variant (614D) into the state. At an individual level, we see evidence of higher viral loads in patients infected with the 614G variant. However, using clinical records data, we do not find any evidence that the 614G variant impacts clinical severity or patient outcomes. Overall, this suggests that at least to date, the behavior of individuals has been more important in shaping the course of the pandemic than changes in the virus.